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The set of doubly-stochastic quantum channels and its subset of mixtures of unitaries are in- 
vestigated. We provide a detailed analysis of their structure together with computable criteria for 
the separation of the two sets. When applied to 0(d)-covariant channels this leads to a complete 
characterization and reveals a remarkable feature: instances of channels which are not in the convex 
hull of unitaries can return to it when either taking finitely many copies of them or supplementing 
with a completely depolarizing channel. In these scenarios this implies that a channel whose noise 
initially resists any environment-assisted attempt of correction can become perfectly correctable. 
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I. INTRODUCTION 

Quantum channels are the most general input-output relations which the framework 
of quantum mechanics allows for arbitrary inputs. Physically, they describe any 
transmission in space, e.g., through optical fibres, and/or evolution in time, as in 
in quantum memories, from a general open-systems point of view. Mathematically, 
they are characterized by linear, completely positive maps acting, in the Schrodinger 
picture, on density operators in a trace-preserving manner. 

The present work investigates the particular class of quantum channels which leaves 
the maximally mixed (chaotic or infinite-temperature) state invariant. These channels 
are called unital or doubly- stochastic (referring to unital and trace-preserving) and 
they appear naturally in contexts with an irreducible symmetry. Apart from their 
practical relevance, the interest in these channels has various origins: (i) they exhibit 
many special properties, e.g., regarding contractivity [l[ or fixed points — often 
allowing for a more geometric intuition, (ii) for small dimensions their additional con- 
straint is strong enough to considerably simplify problems [3] , and (iii) for sufficiently 
large dimensionsproblems on general channels can often be reduced to their unital 
counterparts @, HQ. 

The line of interest taken up by this article concerns the convex structure of the 
set of unital channels and, in particular, its relation to the subset of mixtures of 
unitary channels. This question was addressed and touched upon in 0, [H, Q where a 
crucial difference between the classical and the quantum case was realized: whereas, by 
Garrett Birkhoff 's theorem , every doubly stochastic matrix (describing a classical 
channel) is a convex combination of reversible ones (i.e., permutations), not every 
doubly-stochastic quantum channel has to be a mixture of unitaries. The latter 
set became a stronger relevance when it was realized in Ref.[ll| that a quantum 
channel allows for perfect environment-assisted error correction if and only if it is a 
mixture of unitaries. Another remarkable step was made in Ref.[l3] where evidence 
has been provided that asymptotically many copies of a unital channel might always be 
well approximated by a mixture of unitaries — a conjectured restoration of Birkhoff 's 
theorem in the asymptotic limit. 

An outline of the paper and a summary of its results: 

• In SecHI] we provide two characterizations of unital channels: (i) as channels 
which are convex combinations of unitaries acting on Hilbert-Schmidt space, and 
(ii) as channels which are affine combinations of unitary channels. Moreover, 
we show that extreme points of the set of unital channels need not be extremal 
within the set of all channels. 

• In Scc lini computable criteria for the separation of unital channels from the 
set of mixtures of unitaries are provided and a respective negativity measure is 
introduced. 

• In Sec lIVI we focus on covariant channels (in particular w.r.t. 0(d)) and show 
how symmetry enables us to explicitly determine the above sets and to compute 
the negativity measure. 

• In Sec|V] we apply the acquired tools in order show that families of covariant 
channels outside the convex hull of unitary channels fall back into this set when 
either taking several copies of them or supplementing with a completely depo- 
larizing channel. 
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II. UNITAL QUANTUM CHANNELS 
A. Preliminaries 

We begin with introducing some notation and basic concepts. Throughout we will 
work in the Schrodinger picture and consider quantum channels T with finite and 
equal input and output dimensions, i.e., T : Add — > Aid is a linear map on d x d 
(density)matrices. Complete positivity enables a Kraus decomposition 

T(p) = J2^l $>^ = 1, (1) 

i i 

where the second relation expresses the trace preserving property. A channel is called 
unital if T(l) = 1 and as we include the trace preserving property in the definition 
of a channel, a unital channel is a doubly- stochastic completely positive map. 

It is often convenient to regard Aid as vector space which, when equipped with 
the inner product (A,B) := trL4tj3], forms the Hilbert- Schmidt Hilbert space 9)d- 
Every channel is thus a linear map on this space and has as such a respective matrix 
representation T 6 AAj? — ^(-Q)- 1 We will occasionally use a (non-orthogonal) basis 
for ft which is obtained from embedded Pauli-matrices in the form 

4 k := |i) (k\ + \k) 01 for all j < k 

a( k :=-i(\j)(k\-\k)(j\) for all j < k (2) 
4 := \j) + + Vj = l,...,d-1, 

together with the identity matrix. 

Another useful concept is the state-channel duality introduced by Jamiolkowski [l3T | 
which assigns a density operator px G Md 2 to every channel T via 

d 

PT = (id®r)(|fi)(0|), |n> = -7=5>'>J> 

Vd j=l 

where VL is a maximally entangled state. The states pr corresponding to unital chan- 
nels are exactly those with reduced density matrices 

tn [pr] = tr 2 [pr] = 1/d. (3) 

Note that due to the linearity of the correspondence the convex structure of channels 
is entirely reflected by the convex structure of the set of their dual states. Depending 
on what is more convenient we will switch back and forth between T and pt- 



B. Representations 

In the remainder of this subsection we will prove the following characterization of 
unital channels: 



1 B(X) denotes the space of bounded linear operators on X. 
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Theorem 1 (Characterization of unital channels). Let T : Aid — * M-d be a quantum 
channel. Then the following are equivalent: 

1. T is unital (i.e., doubly-stochastic), 

2. T is a convex combination of unitaries on Sjd, i-c, T — ^2 a p a W a with the p's 
being probabilities and each W a G Aid 2 unitary, 

3. T(-) = ^ i \i Ui ■ U\ is an affme combination of unitary channels, i.e., the X's 
are real and sum up to one and each Ui G Aid is unitary. 

In order to see 1<£=>2 we use a result from [ij: for any p > 1 a positive trace-preserving 
map T is a contraction in the sense 2 of ||T|| < 1 iff 3 T is unital. In addition we 

have ||T|| 2 ^ 2 = HT 1 ]]^ so that T is unital iff T is a contraction with respect to the 
operator norm. The set of these contractions in turn is the convex hull of unitaries 
(as can be seen from the singular value decomposition) which completes 1<=>2. 
As 3=4>1 is obvious it remains to show 1=>3. To this end we introduce 

X := {A e M d : A = A\ txA = 0} , 
V:={Ah UAU f : U eM d unitary} C B{X). 

That is, X is a real linear subspace of containing all Hcrmitian operators orthogonal 
to 1 and V are the unitary conjugations on X . Note that the real linear span of V is 
invariant under composition and that the set in Eq. @ (without the identity) forms 
a basis of X . The idea is now to show first how B(X) can be obtained from V and 
then to extend this to the claimed implication 1=^3 in ThmJTJ 

Denote the subspace of real linear combinations of vectors {x±, . . . ,x n } such that 
the coefficients sum to zero by 

zerospan R {xi, . . . ,x n } := A, x l : A; G M, ^ A; = j . 

Lemma 2. For each basis vector B G X in {2J there exists a T G zerospan R V which 
maps B to itself and all other basis vectors to zero. 

Proof. We explicitly construct such a T w.l.o.g. for a^ 2 . Set 





:>+ 


-\ 




I 4 and a = 


a ■ a 


B* 


B 


C 



la 



T 2 (p) := ~ (p - U 2 pU^j G zcrospan K V, U 2 := | J , 



Then for all «el and a = (cr x , o~ y , o~ z , 1 2 ) 



A := 



a ■ a 








C 



ot\G x + a 3 a z 







2 The norms are defined as ||T|| p _ 9 = sup A \\T(A)\\ q / \\A\\ p with \\A\\ p = (tr[(AtA)f/ 2 ]) 1/p . 

3 As usual 'iff' should be read 'if and only if. 
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In a similar vein we can finally map a>3 to zero by a T3, denned as T 2 only with a y 
in U2 replaced by a z . Then T := T3 o T 2 T\ is the desired operator which satisfies 
T(A) — ceial 2 , and T € zcrospan R V as T 2 G zerospa%V. Clearly, the same type of 
construction works for all basis vectors in |2]). □ 

Lemma 3. For every pair of basis vectors Bi , B 2 G X in @ , there is a T € V swc/i 
i/iai T(Bi) = B 2 . 

Proof. As £>i and £> 2 are Hermitian, there are unitaries U\ and U2 such that UjBjUj 
(j = 1, 2) are both diagonal. These can in turn be mapped onto each other by a 
permutation in V since they both have eigenvalues (1, —1, 0, . . . , 0). Exploiting that 
V forms a group we can compose these steps to obtain T(Bi) = B 2 . □ 

Proposition 4. V zero-spans all linear operators on X , that is, 

zerospan R V = B(X). 



Proof. For any two basis vectors B\ , B2 in ((2]) , by the above lemmas there is a T G 
zerospan R V which maps B\ to _B 2 and all other basis vectors to zero, so that a linear 
combination of these T's generates any linear map on A". □ 

This immediately implies 1=>3 in ThmJT]as for every unital quantum channel T we 
have that T — id G B(X) so that we can write T(p) — p + J2i UipU} with the A's 
summing up to zero. 

C. Extreme points 

The set of all unital quantum channels on Md is convex and compact. That is, 
every unital channel T can be decomposed as 

t = J2p^ ( 4 ) 

i 

where the p's are probabilities and the T^'s are extremal unital channels, i.e., those 
which cannot be further decomposed in a non-trivial way. Despite considerable ef- 
fort @, H, [Hi [IB [3 not much is known about the explicit structure of these extreme 
points beyond d = 2 (in which case they are all unitary conjugations 0). The small 
contribution of this subsection is to review the existing results and to apply them in 
order to show that channels which are extremal within the set of unital channels are 
not necessarily extremal within the convex set of all channels. To the best of our 
knowledge all known examples so far were extremal within both sets — although the 
numerical results stated in [16j already indicate that this might not be generally true. 
The main ingredient is the following theorem which is stated in Q and based on (l7j . 

Theorem 5 (Extremal channels). Consider a quantum channel with Kraus operators 
{^4i}i=i,...,Ar- It is an extreme point within the convex set of quantum channels iff the 
set of matrices 



{4M 
1 j k,i=i... 



(5) 



G 



is linearly independent. Assume further that the channel is unital. Then it is extremal 
within the convex set of unital channels iff 



{AlAtQAiAl} 



k,l=l...N 



(6) 



is linearly independent. 



We will exploit the fact that ([5|) allows less linearly independent operators than (J6j> : 
while ([5]) gives the simple bound N < d, the set ([6]) yields N < \/2d. 4 For our example 
we choose dimension d = 3 and N = 4 linearly independent Kraus operators. The 
former ensures that there are non-trivial extreme points, and the latter already implies 
that can never be linearly independent as N j£ d. We start with an Ansatz for 
the Jamiolkowski state of the sought channel of the form 



pr = (id®T)(|n) (fi|) 



^2 Xij (t/jj 



where the (|V J i))i span the orthogonal complement of (\kk)) k , namely 
| ( ,.} = — (|12) - |2l ) i . |r.,} = — (|13) + |31» , |Vs> = — (|23> 
1 12 



JL(|12) + |21)), |^ 2 ) = -1-(|13) + |31)), IV>3> = ^ 
|^) = -L(|12>-|21)), |V 6 ) = -l:(|13)-|31)), |^,) = -L(|23) 



and the Hermitian matrix X = (xij ) is given by 



X 



V2i 



132)), 
132)), 





/ 


1 

2 







«M3 


1^4 


\ 









l 

2 




-ifi 4 


-(2 + i)A*s 





1 




i/xi 




i 

2 








2 M2 + « 


3 




-i^3 


ip 4 





i 

2 





-*>i 






-i(i4 (i 


-2)M3 








1 

2 






V 








2 fi 2 - i (i 3 




-i/ii 





(7) 



The latter is chosen such that pr satisfies the conditions corresponding to a unital 
and trace-preserving map. It remains to choose algebraic numbers \i\ , . . . , /U4 6R such 
that X is positive semidefmite with rank TV = 4, and that at the same time (|6|) is 
linearly independent when plugging in the corresponding Kraus operators. A possible 
choice for such a set of parameters is provided in appendix I Villi 



III. MIXTURES OF UNITARY CHANNELS 

This section deals with the class of unital channels which can be represented as 

N 

T(p) = PiUiPUh u * u l = 1, ft > Vt. (8) 



4 In fact, in [15| it was shown that TV < y/2d 2 — 1 which is, however, practically the same as TV < \/2d 
when applied to integer TV. 
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The Jamiolkowski states corresponding to these mixtures of unitary conjugations are 
exactly the states which are convex combinations of maximally entangled states. The 
rank of the Jamiolkowski state pr gives a simple bound [18] for the minimal N as there 
exists always a decomposition with N < (rank px) ■ For d — 2we can achieve equality 
in the general lower bound N > rankpp and, as mentioned before, every unital channel 
on Ai 2 is a mixture of unitaries Q- For d > 3 the question whether a given unital 
channel allows for such a representation was investigated and reformulated in [16| but 
a general operational way of deciding it remains to be found. The approach in the 
following subsection provides a class of easily computable necessary conditions which 
when applied to covariant channels will later be extended to necessary and sufficient 
criteria. 



A. Separation witnesses 



Since the set (|8]) of mixtures of unitary channels is convex and compact, every 
unital channel which lies outside this set can be separated from it by a hyperplane — 
a witness. As this can most easily be expressed on the level of Jamiolkowski states 
we introduce the corresponding sets 

S := {p T ■ Pt = (id ® T) (|fi) (fi|) , T : M d -> M d cp, tp, unital} 

= {pE M d 2 : p > 0, titp = tr 2 p = 1/d} , 
W:=conv{(l®J7)|Q)(f2|(l<g)[/t) : UU* = 1} , 

which we will, with some abuse of notation, occasionally also use for channels, i.e., we 
will write T € <S' meaning p? € S. The following shows that we may impose some 
structure on the witnesses — they can be taken from the affine span of U. 

Proposition 6 (Separation witnesses). Let p G S characterize a unital quantum 
channel. Then p S hi, i.e., it is a mixture of maximally entangled states, iff 

tr [Wp] > 

for all Hermitian operators W € M d 2 which satisfy 

tnW = tv 2 W = 1/d, tr [Wa\ > Vcr e U. (9) 

Proof. We have to show that if p £ U, then there exists such a W with tr [Wp] < 0. 
First note that 

X := {A G M d 2 : A = A\ triA = tv 2 A = 0} 



is a real linear space and S — \/d 2 C X . Set p 



1/d 2 e X. Using the Hahn- 



Banach separation theorem d9J, theorem l.C in chapter 1] we find a W € X with 



tr 



Wp <-l/d 2 , but tr Wa > -1/d 2 Vct G (U - 1/d 



Setting W :=W + 1/d 2 yields the sought witness. 



□ 



To simplify matters we will in the following also consider Hermitian witnesses which 
do not fulfill the l.h.s. of ([9]) as long as the r.h.s. is satisfied. A class of this kind which 
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turns out to be particularly useful are operators constructed from the flip operator 
F : \k, 1} i-> \l, k) in the form 

W= (l®S)F(l(g)S t ) +w{B)l, BeM d , (10) 

where w(B) g R is a constant depending on B such that W fulfills the r.h.s. in (J9j> . 
Before we determine this dependence let us note that replacing (10 B) by (A®B) in 
Eq. (p~0|) won't lead to a more general class of witnesses since (A ® E) F (A' ® Stj = 
(l ® #4*) F (l (g) ABt) . 

The sharpest constant w(B) for which pop fulfills the witness condition tr[W / p] > 
for all p £ U is obtained from 

w(B) = - mintr [(1 <g> S)F (l <g> B f ) (1 ® E7) (fi| (l <g> t/*)] 

= - -mintr \B^UB T U\ 
d u 1 J 

= -iminjtr [AA] : cr(A) = a(B)}, 

where £/ is unitary, A G Aid and er(yl) denotes the singular values of A. We solve 
this matrix optimization problem in appendix IVII Al arriving at the following result. 

Theorem 7 (Tight witnesses). For any B £ Md with singular values a\ > . . . > ad 
the operator in Eq. (I10p is a separation witness iff 

1 f ^ Xa=i ^i— i02»i ^ ewen 

w(S) > - < / (d _i )/2 \ 2 (11) 



Note in particular that for B = 1 and d odd we get u> > 1 — 2/d while for d even 
u; > 1. Hence, for even d no channel is separated from 11 by such a witness (since 
F + 1 > 0). However, we will see in Sec II V Bl that for d odd it becomes a powerful 
tool. 



B. A negativity measure 

There are several possible ways of quantifying the deviation of a channel T g S\U 
from being a mixture of unitary channels: one may for instance follow [l6l |. use 
the entanglement of assistance fl2L H3] or the minimal distance to the set li w.r.t. 
some distance measure. The representation ThmQ] enables a very natural alternative 
approach — a base norm (inspired by plj). That is, the deviation is quantified by the 
smallest negative contribution when representing T as an affine combination of terms 
in U. More formally: 

Definition 8 (Negativity). For all p € S the base norm associated with U is 

\\p\\ u := inf {a p + a n : p — a p a p - a n a n , a p . n > 0, cr Pi „ eM}, 

and the corresponding negativity is given by 

Nu(p) ■= inf {a n ■ p = a p a p - a n a n , a p . n > 0, <7 p , n eW}. (12) 

For tr[p] = 1 the two are related via \\p\\ u — l + 2Nu(p) and obviously Mu (p) = iff 
p € U. The base norm behaves nicely under concatenation and convex combination. 
Writing ||T||„ := \\p T \\ u we get 
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Proposition 9. Let Tj G S be a set of quantum channels and pi > probabilities 
then 

ln r *L^nii r <iiw> md 

II 1 1 IA 

Both can easily be proven from the definition. The latter can be interpreted as 
coming from triangle inequality and homogeneity of the norm. Note also that the 
above norm is unitarily invariant in the sense of ||TV|| W = ||VT|| W = \\T\\ U for every 
unitary conjugation V. 

As always measures are easy to define but hard to compute. For covariant channels 
we will show the calculation in Sec. IIVBI 



IV. COVARIANT CHANNELS 



In order to arrive at more explicit results we need some help — coming in the form 
of symmetries imposed on the channels. Consider any subgroup Q C U{d) with 
elements g € Q and two unitary representations V g ,V g on C d . We say that a channel 
T : Aid — > Aid is ^-covariant w.r.t. these representations if for all g G Q: 

T(V g ■ V}) = V g T(.)V}. (13) 

In this sense the action of the channel 'commutes with the symmetry'. If V is an irre- 
ducible representation then T is unital as T(l) = / dg T(V g V^) = J dg V g T(l)Vj = 1 
by invoking Schur's Lemma (where dg is the Haar measure). In order to express 
Eq. (fl3|) in terms of the Jamiolkowski state pr we introduce G = {V g <g> V g } g ^g and 
its commutant G' = {X G Aid 2 '■ V U g G G : [X, U g ] — 0}. Covariance of the channel 
translates then simply to 

p T G G'. 

As we will see below most of the analysis can w.l.o.g. be restricted to this commutant 
which considerably simplifies matters as dimG' is for a sufficiently large symmetry 
group much smaller than d , the dimensionality we would have to deal with otherwise. 
The map 



P(A) := 




defines a projection in B(Aid 2 ), often called twirl, which maps every matrix A into G' 
and acts as the identity on G' . Moreover, since G' is an algebra it is spanned by a set 
of minimal projections {Pi}. These are orthogonal if G' is abelian (which happens for 
large enough symmetry groups) so that every X G G' can be written as X = J2i x iPi 
with Xi — tr[XPj]/tr[Pj]. In this case we can easily determine 5 



5 Here we have used that tr[P(A)P l ] = tr[AP(Pi)] = tr[APj]. 
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If G' fails to be abelian a similar reasoning still applies — for a detailed exposition of 
these matters we refer to [21]. In order to see how covariance helps for our purposes 
let us denote the set of witnesses by W := {W = : Ver eU : tr[Wa] > 0}. 

Proposition 10 (Reduction to the commutant). Let p E SC\G' be the Jamiolkowski 
state corresponding to a covariant unital channel. Then p ElA iff tr[Wp] > for all 
W E Wn G'. Moreover, 

\\p\\ u = inf {a p + a n : p = a p a p - a n a n , a p>n > 0, a p>n E U f~l G'} , 

which equivalently holds for the negativity Mu ■ 

Proof. The crucial point for both parts is that a E U implies P(a) E U which in 
turn means that P(W) E W for every W E W. Therefore due to tr[pP(W)} = 
tr[P(p)W] = tr[pW] the set W can w.l.o.g. be restricted to G'. Regarding the base 
norm we arrive at the stated result when starting with any optimal decomposition 
p = a p o~ p — a n o~ n and applying the twirl to both sides of the equation. □ 

This suggests the program for the next subsections: fix a symmetry group, identify 
the commutant G' and determine U, \\-\\ u and Mu by exploiting the reduction to G'. 

A. O(d) covariance 

The symmetry we will consider is the one of the real orthogonal group, i.e., G = 
{O ® O : O E M.d real orthogonal}. The most prominent non-trivial example of a 
channel having this symmetry is 

T(p) = {tT[p]l-p T )/(d-l), (15) 

which (for d = 3) gained some popularity as a steady source of counterexamples: for 
the multiplicativity of the output p-norm [22| . the additivity of the relative entropy 
of entanglement [23| and, most relevant in our context, the quantum analogue of 
Birkhoff's theorem 0]. On the level of Jamiolkowski states we can make use of the 
analysis in [23| where the commutant G' was shown to be abelian and spanned by 

G' = span 1 1, 1, f | , 
where F := d |0) From there the minimal projections can be identified as 

p = ±F = \n) (0| 

d 

Pi = \ (1 - F) 

P 2 = i(l + F)-iF 

where (l±F)/2 are the projections onto the symmetric and anti-symmetric subspace, 
respectively. Consequently, every density operator in G' is in the convex hull of the 
corresponding normalized density matrices pi — Pj/tr[Pj] of which p\ corresponds to 
the Werner-Holevo channel in (fT5|) . p is the ideal channel and J^i Pi/d 2 corresponds 
to the completely depolarizing channel T(p) = tr[p] 1/d. Clearly, all of them are 
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(?) 




FIG. 1: The set of orthogonal covariant channels PS in the Jamiolkowski representation 
(outer/green triangle) and the convex hull of unitary channels PU (blue/shaded area), which 
is described in analytic terms by proposition 1121 Note that the Werner-Holevo channel 
P- is "furthest away" from the unitaries. The orange line (from p_ to p+) depicts the 
U(d) covariant channels, and the dotted unit-square corresponds to entanglement-breaking 
channels. Compare with Ref. [23l . Fig. 2]. 



unital, i.e., elements of S. Every state p G G' is completely characterized by its 
" coordinates" 



(F),(F>) = (tr[pF],tr 



pW 



Especially for the extreme points pt we obtain (see Fig. [T]) 



state 


Po 


Pi P2 


coords 


(M) (- 


-1,0) (1,0) 



(16) 



B. A complete picture 

We will now determine the subset U of mixtures of unitary channels within the 
set of 0(<i)-covariant channels. Following the above considerations this amounts to 
identifying the corresponding region in the two-dimensional parameter space 



WnCs/(<F>,(F>) p : pell} 



(17) 

{((F), (F>) : U unitary}, 
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where the index U stands for the expectation value w.r.t. (1 <g> U) |f2) which parame- 
terizes an extreme point within hi. A short calculation reveals that 



(F)^ = tr [(1 <g> U) |fi) (fi| (1 <g> U ] ) F] = i tr [UU] 



<F> f 



tr 



(1 U) |Q) (fi| (1 ® f] = - |trf/| 5 



(18) 



The picture depends crucially on whether d is even or odd. 



Theorem 11 (Even dimension). // d is even then U n G' = SO G' , i.e., every 
0(d)-covariant channel is a mixture of unitary channels. 

Proof. It suffices to note that the expectation values (|16ll8p with respect to pi and 
Ui coincide for Uq = 1, Ux = diag (a y , . . . , a y ) and XJi = diag [a z , . . . , cr z ), just by 
plugging in (fT8| . □ 



So the interesting structure only emerges for d odd (see Fig. [T] for d= 3), for which 
we need the following result proven in appendix IVIIBI 

Proposition 12. Let d > 1 be odd. Then for all x G [— l + |,l] there exists a 
unitary U £ U(d) such that tr [UU] /d — x, and 



max{|tr[/|/d : U G J7(d), tr [UU\ /d = x) 



1/2 



m(x) 



(19) 



Theorem 13 (Odd dimension). Let d > 3 be odd. Then the extreme points of the 
set (|17p corresponding to mixtures of unitary channels are 

(-l + 2/d,0), (1,0) and Ux, d(m(x)) 2 ) : x G [-1 + 2/d, 1]| . (20) 

Lroo/. "(TlT]) C conv ([2^|l ": For all unitary U G U(d), 

\tr[UU] G [-l + 2/d,l], 

which follows from the fact that for any matrix A the spectrum of AA is symmetric 
with respect to the real axis, the eigenvalues A, A have the same algebraic multiplicity, 
and the algebraic multiplicity of all negative eigenvalues of AA (if any) is even, see [24| . 
Together with Prop. [TJ] we obtain the stated bounds on (fT7|) . 
"O C (HZD": Set 



Qo 



and ip := exp (27ri/3), then the coordinates ([2"D"|) are obtained by 

Uq = diag (cTj,, . . . , a y , Qo), U\ = diag (a z , . . . , a z , ip, ip 2 , l) and the unitary matrices 

which solve the maximization problem (|19[) (explicitly given in appendix IVII B[) . □ 
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The fact that according to Prop. [101 we can restrict to decompositions within the 
two-dimensional parameter space, together with the explicit characterization of the 
set U HG' enables us now to compute the negativity Nu, as follows. 

We show hrst that in Eq. Ijl2|) . a n = P2 always obtains the infimum, as illustrated in 
Fig. [2 Since UDG' is convex and closed, the optimal a p . n in (|12p are on the boundary 
of U n G', and a n lies either on the segment joining p 2 and po or the one joining p2 
and the covariant state with coordinates (— 1 + 2/d, 0). We may w.l.o.g. assume 
the former, i.e., a n = Xp + (1 — \)p 2 for a A G [0, 1]. Considering decompositions 
p = a, p <j p — a n a n with optimal a p (depending on A) given p and a n via A, both 
a Pi n are already determined by the ^-coordinates of p and a Ptn due to a p + a n = 1. 
Note that the x-coordinates of p and a n remain fixed for all values of A whereas the 
^-coordinate of a p is non-increasing as A decreases, and so is a n . That is, A = or 
equivalently <j n = pi minimizes a n . 

It follows that a uniform scaling of the boundary of U n G' by a factor (1 + a n ) 
starting from p 2 as origin yields precisely the set of points with negativity a n . 

We may write each p £ S \ U (1 G' in terms of a convex combination of the pi listed 
in table [ini that is, p = J2i 1i Pi w i tn 1i ^ an d J2i 3» = 1- From Fig. [5] it is evident 
that qi > for all p <^U. Set q := qo/qi and distinguish the following two cases due 
to the particular shape of U D G'. 

• q > g7gzrn • This corresponds exactly to the area above the dashed line in Fig.O 
Applying the scaling (1 + a n ) to the curve in theorem !13l an explicit calculation 
shows that 



^ = dh ( d - 1 + { d+ dh)«- 2d d^2^ 2 + d cr 1 i (1 K- 1 ' 



3 



P : 



^qiPi£S\U with q t > 0, }^qj = 1, q:=q /q\. 

i—l i 

• <Z < cUd—i) • ^ n ^ ms case the negativity does not depend on q, and we get 

Mu(p) = -j^jii - 1. 

In particular, Afu(p) is maximal exactly for the Werner-Holevo channel p_, namely 
Afu( P -) = l/(d-l). 



V. RESTORING BIRKHOFF'S THEOREM 



Measures quantifying the deviation of a unital channel from being a mixture of 
unitary channels are known to be not additive (or multiplicative). That is, a naive 
extrapolation from the 'distance' between a given T £ S \ U and U typically leads 
to an overestimation of the respective quantity for T® n , i.e., several copies of the 
channel. This effect was studied in detail in the context of the entanglement of 
assistance (l2l . [20| 

E a {p) :=sup|^p i S'(tr 1 * i ) : p = J2Pi\^i)(^i\\ > 
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<F> 




FIG. 2: Negativity as distance measure, exemplified by orthogonal covariant channels. Mu 
is constant along red lines, which are obtained - in geometric terms - by a uniform scaling of 
the unitary channel boundary about p2 as origin. a p . n £ PU are the optimal states in (|12|l 
given p£ PS\PU. 

where S(p) = — tr[p\ogp] is the von Neumann entropy, and the supremum has to be 
taken over all convex decompositions of the given state p € B(C d ® C d ) into pure 
ones. As S (triSf^) < logd with equality iff ^ is a maximally entangled state we have 
that E a (p) < logd with equality iff p E U. It was shown in [12J that 

VpeS: lim -E a (p® n ) =logd, 

n — ^oo Jl 

which suggests that the approximation of p® n by an element of U improves as n 
increases. This would mean a restoration of Birkhoff's theorem in the asymptotic 
limit. Whether this statement is valid in general when formulated in terms of norm 
distances (either for channels or, supposedly weaker, for states) remains an open 
problem [25j |. 

In the following subsections we will prove it in the strongest possible sense for a 
class of 0(d)-covariant channels. We will see that at least for these cases neither 
the asymptotic limit nor an approximation is required — a remarkable effect from the 
perspective of environment-assisted error correction fSec IV Cj) . 

More specifically we will show that for a T $ U we find T ®T €U when choosing 

T : Ma — > Ma, T(p) = ^Atr[p]l-5p T , d odd, (21) 
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with appropriate 6 and either f = T (Sec IV A|) or f : p £ Md >-> tr[p] 1 e Md 
completely depolarizing fSec lVBj) . The symmetry of the channels will help us in two 
stages: (i) we can use Thmll3l which tells us that T for 6 > l/(d+ 1), and (ii) it 
circumvents having to find an explicit decomposition in terms of unitary channels for 
T ®T: if the convex hull of the relevant G expectation values of any set of unitary 
channels contains the ones of T ® T then the twirling projection P does the rest of 
the job. 



A. Two copies of a channel 



As usual we switch to the Jamiolkowski representation, where the family of channels 
in Eq. (|2"Tj) becomes 



in .= (1-1 + 1),,. 



3" 1 



d odd 



where p± are the normalized projections onto the symmetric and anti-symmetric 
subspace, respectively. The parametrization is chosen such that px € S \ U for 
e€ (0,2/d] since 

2 

tr [pr¥} = -l + --e, 

and F + (1 — 2/d) 1 is a tight separation witness according to Thms I7TT51 To exploit 
the full symmetry coming from f = T we follow section V.B of [23j and increment 
the tensor product symmetry group 6 {(U ® U) ® (V ® V)} by a flip operator which 
interchanges the tensor factors in the product T®T. This results in a larger symmetry 
group G, thus yielding a smaller commutant G' C .M^ which is spanned by 1 and 



F 



(1®F + F® 1), Fi 2 :=F®F. 



That is, every state p € G' is now completely characterized by the expectation val- 
ues/coordinates 

((F),(F 12 )) p = (tv [pF],tv[pF 12 ]). 

Especially for any unitary channel described by U € .Md 2 , setting U s := | (U + U T ) 
and denoting partial transposes by Ti gives 

(F 12 ) u = tr [F 12 (1 ® U) p) (fi| (1 ® C^)] = i tr [C/Z7] , 



(F)^ = tr [F (1 ® U) |0) (1 t/ f )] 



2d 2 



tr 



f/(Z7 Tl +I7 T2 ) 



(22) 



17* C// 2 



The aim is to use the full symmetry group. That is, we use V Cg> V, V S U(d) for px which already 
allows us to discard F from the commutant. We use further that the commutant of a tensor 
product is the tensor product of the commutants and that Prop. [10] remains true when adding the 
additional flip operator. 



1G 



— T2 

The last equation uses the fact that U s is again symmetric. The ranges of the 
expectation values in Eq. (|22]1 are studied in appendix IVII CI In particular for d = 3 
we provide an explicit construction for the coordinates 

((F), (*i 2 »* = i (-^(costf + l) 2 + 3, 16 cos 2 t? - 7V tfe[0,7r/2] (23) 

corresponding to convex combinations of unitary channels. Now matching the coor- 
dinates of T (g) T, 



((F),(F 12 )) T = (tr[p T F],tr[p T F] 2 ) = (x,x 2 ) , x := -1 



with (J23J) yields 

2 



c 



-(4-3\/2-V3 + V6 
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as shown in Fig. [3] The blue area corresponds to convex combinations of unitary 
channels 7 , i.e., elements of U, the orange curve to coordinates of single-channel tensor 
products and the red part of this curve to the elements of U on the single-channel 
level. The state at the lower corner is 

1 / 

Pro = j (P~ ® P+ + P+ ® P-> ■ 

As can be seen from direct inspection, each point on the curve (123[) is an extreme 
point of the blue area. The remaining extreme points (1, 1) and (1, — |) are realized 

by the unitaries 1 and ( 1 _1 )® 4 © 1, respectively. 



B. Help from a noisy friend 



Instead of adding a second copy of the channel, we will now supplement it by 
a completely depolarizing T. The Jamiolkowski representation of the completely 
depolarizing channel f : B (K\) — > B (/C 2 ) , Ki = C D is 

Pi = (id®f)(|fi) (fi|) = 1/D 2 . 

Let H be the corresponding symmetry group of all local unitaries V\ <8> Vi with V% G 
B (/Cj), then H 1 = span{l}. Using again that 

(G <g> H)' =G'®H' 

(see example 7 in section II. D of (23|) we get that every element of the commutant is 
completely characterized by the expectation value of Y = F^ 1 g,^ 2 ® lyc 1( 8K:2! yielding 

tr [(pr © p t ) y] = tr [ PT ¥] =-l + l- e . (24) 



7 A proof that (1 23 I t really solves the minimization problem (132 1) as acclaimed is still outstanding, 
but strongly supported by numerical tests. In any case, the set of convex combinations of unitary 
channels is at least as big as the blue area. 
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FIG. 3: Twofold tensor products of covariant channels for d — 3 (outer/green triangle). The 
mixtures of unitary channels correspond to the blue/shaded region, the orange parabola to 
tensor products of a channel with itself and the red part of that curve to the elements of U 
on the single-channel level. Compare with Fig. 9 in [231 ]. 



Since —1 < Y < 1 every normalized state p satisfies tr [pY] £ [—1,1]. In order to 
obtain the subinterval of [—1, 1] covered by convex combinations of unitary channels, 
we have to calculate (Y) u for unitary U : Hi ® K,\ — ► H2 <8> IC2, 



tr [(1 <gi U) |fi) (l® E/ f )F] 



1 

dD' 



-tr 



UU 



■T-2 



(25) 



As U = 1 reaches the upper bound 1, the hard part is the lower bound which is 
treated in appendix IVII Dl The results suggest that for D = 2, Eq. (|25|) gives 



(26) 



for the range in which T ®T £ U while, recall, T £ U only within [-1 + 2/d, 1]. 
The interval (|26p can be related to the conjectured existence of a certain quaternion 
matrix, which we construct explicitly for d = 3 and d = 5. This means that in this 
case (|25l) covers this range at /east In particular, for e < 2(d— l)/d 2 , the expectation 
value (|24[) lies within this interval such that px <g Pf becomes then indeed a convex 
combination of maximally entangled states. 

For higher values of D, we reproduce table HT1 from appendix IVII Dl 
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d 


D = 1 


2 


3 


4 


5 


l 


1 


1 


1 


1 


1 


3 


i 

3 


7 

9 


23 
27 


_ 4 
5 


37 
45 


5 


3 
5 


23 
25 


-0.929 . . 


-1 


-0.976... 


7 


5 
7 


47 
49 









TABLE I: Numerical minimization of (|25p . Note that the table is asymmetric w.r.t d <-> D, 
and that the lower bound —1 can apparently be obtained for d = 5 and Z? = 4, that is, each 
channel-tensor product T ®T (with T the completely depolarizing channel) becomes then a 
convex combination of unitaries. 

C. Environment-assisted error correction 

The above results become especially remarkable from the point of view of 
environment-assisted error correction — a concept introduced in [Il(. There it was 
studied which channels allow complete correction, given a suitable feedback of classi- 
cal information from the environment (see Fig. The class of perfectly correctable 
channels was identified with the set of convex combinations of unitary channels. In 
this way the above observations yield examples of channels which are not perfectly 
correctable on their own but become so when either taking several copies, or supple- 
menting with a completely depolarizing channel. 




FIG. 4: The correction scheme as in applied to the simultaneous usage of two noisy 
channels T : B(Hi) — > B{7i2) and T : BQCi) — > B(K,2)- The channels are represented by 
unitary couplings U and U to an environment which is initially in a pure product state. The 
classical result a of the measurement on the global environment is used by the receiver who 
chooses the recovery operation R a (again a quantum channel) accordingly. As discussed in 
the text, T ®T can become perfectly correctable (i.e., T corr = id) although neither T nor T 
is so. 



VI. DISCUSSION 



The presented investigation of the set of unital quantum channels is to a large extent 
based on and inspired by methods and ideas from entanglement theory. The tools 
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acquired in this context could be directly applied to the Jamiolkowski representation of 
the channel. This approach as such leads to questions about further analogies between 
the two fields. It would in particular be interesting whether a useful counterpart to 
positive maps, i.e., powerful non-linear criteria can be found for the separation of the 
set of mixtures of unitary channles 8 . 

Clearly, the asymptotic Birkhoff conjecture [25| remains an important open problem 
for which the present work might be regarded as supporting evidence as it provides 
the first class of examples for which there is a rigorous proof. In this context it might 
be interesting to investigate T ® Tn with Tn a D-dimcnsional maximally depolarizing 
channel, as studied in Sec lVBl Is there a dense subset of unital channels such that a 
finite D makes T (£> T-q a mixture of unitary channels? 
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VII. APPENDIX A-MATRIX OPTIMIZATION PROBLEMS 

A. Minimizing tr[AA] subject to fixed singular values 

We solve the minimization problem posed in section IIII Al which handles a.o. a 
special class of separation witnesses. Considering any complex d x d matrix A, we 
denote its singular values by o~i(A) (counting multiplicity) such that o~\(A) > •• • > 
ad ( A) . Our main result is the following proposition, which is similar to theorem 7.4.10 
in [2f|. 

Proposition 14. Let A £ Md and ax > • • • > ad denote the singular values of A. If 
A A is Hermitian, then there exists a permutation r of {1, ... ,d}, an even r < d and 
a function p : {1, . . . , r/2} — ► {0, 1} with 

r/2 d 

tr [AA] = 2]T(-1)^> a T(2l ^ 1} a r(2t) + £ a% y (27) 

i=l i=r+l 

Conversely, given any such r, r, p and nonnegative numbers o~\ > • • • > there exists 
an A £ Md such that cri(A) — Oi for all i and J27|) holds. 

Proof. We split the "=>" part into the following steps: 

1. tr [^4^4] and the singular values of A are invariant under A i— > UAU T for any 
unitary U. Note that this map sends AA UAAW, so by the spectral theorem, 
w.l.o.g. AA real diagonal. 

2. It follows that AA = AA = AA, i.e. A commutes with A, and each of the 
eigenspaces of AA is invariant under A and A. Stated differently, A is block 
diagonal, each block corresponding to an eigenspace of AA, so w.l.o.g. AA = A 1 
for a A e K. 



Here it might be helpful to replace positivity by some form of contractivity. 
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3. The case A ^ 0: Applying the singular value decomposition yields unitary 
matrices U, V such that A\ := U^AV = diag (oi, . . . , ad). Using A = \A~ X , we 
have 



A 2 :=V*AU = \(U*AV) 1 = A^ 1 = A diag (of 1 , a 



A\ and A<i sharing the same singular values translates to {ai, . . . , er^} = 
|A| {cj -1 , . . . , cr^" 1 }, so there is a permutation r of {1, . . . , d} with 

| 0"t(2»-i) 0V(2») = |A| for i = 1, . . . , |, d even 

\ 0V(2»-i) 0V(2i) = |A| for i = 1, . . . , ^fi, o^ (d) = |A| , d odd. 

Note that d cannot be odd if A < as the negative eigenvalues of AA are of 
even algebraic multiplicity (see e.g. [26j], pages 252, 253). Concluding, tr [AA\ 
can always be written in the form (|2T[) . 



4. The case A = 0: Let r denote the number of nonzero singular values of A. 
AA = means that range (~A\ C kern(A) , so 

r = rank(A) = dim range (A) < dimkern(j4) = d — r, 

i.e. 2r < d, and there is a permutation r such that each summand in the right 
hand side of (j27|) is zero. 

To prove the "=>" part, set A := diag (Ai, . . . , ^4 r /2, Cr(r+i); ■ ■ ■ 7 °V(d)) with 

□ 

Corollary 15. Given any nonnegative numbers <j\ > ■ ■ ■ > <Jd, 
min {tr [AA] : A G B (H) , a^A) = ffi Vt} 

_J -S^i^i-iffa, d even (28) 

4 



" 2 Yj d i=i /2 °2i-\On + a\, d odd. 



Proof. What remains to be shown is AA being Hermitian for optimal A; then propo- 
sition [TH guarantees optimality. Exploiting invariance under A 1— > VAV T for unitary 
V we can w.l.o.g. assume that A = UD with D — diag (a±, . . . , o^) and E7 unitary. 
Now vary U to minimize tr [A~A\ ; the unitary constraint translates via 

Z7 dt/t + d[/[/t = d(C/C/ t ) =0 

to X := idUU^ being Hermitian. We have 

= -^-tr \DUDU] 
dil 1 J 

= 2Rctr [DdUDU] 
= 2Imtr [J/DCfDX] . 

This has to hold true for any Hermitian matrix X. Decomposing UDUD = B\ +iB2, 
B\ and B2 Hermitian, it follows that B2 = 0. □ 
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B. Maximizing |tr[[/]| subject to fixed tr [[/[/] 

In this subsection we prove proposition [T2l from section llVBl i.e. we calculate the 
analytic solution of max|tr[/| for fixed tr [UU\ over all unitaries U £ [/(d). The 
motivation for this optimization problem comes from Eq. (|18j) . which characterizes 
the convex hull of unitary channels within the set of orthogonal-covariant channels. 

We need the following lemma first. 

Lemma 16. Let U £ U(2) be a unitary 2x2 matrix and U s := | (U + C/ T ) the 
symmetric part of U . Then the trace-norm of U s equals 



H^ll^ ^{U s ) = ^tt[UTj]+2. 

3=1.2 

Proof. There are a, j3 £ C such that up to an unimportant phase factor 



a (3 



U = [ >_j with \a\ 2 + \[3\ 2 = 1. 

Direct calculation shows that 

tr [UU] = 2 (|a| 2 - Re (/? 2 )) = 4 (|a| 2 + (Im/3) 2 ) - 2, (29) 

U B = 



a i Im P 
i Im P a . 



U s Ut = (|a| 2 + (Im/3) 2 ) 1, (30) 
£ a, ([/ B ) # 2 V / |«| 2 + (Im/3) 2 ® ^tr [[/77] + 2. 



3=1,2 



□ 



Proposition 17. Let d > 1 be odd. Then for all x £ [— 1 + i/iere exists a 

unitary U £ f/(d) such that d _1 tr [f7f7] = x, and 

max{d _1 \tvU\ : U £ 17(d), d -1 tr [[/U] = x) 



2 \ d/ \ d 

1/2 



1/2 ! (31) 



Proof. Set a := ^ [± (l - ± ) (l - § + z)] 7 e [0,1], /? := y/1 - a 2 and cr := 

(-°/3a)' then a short calculation shows that U := diag (cr, . . . , <r, 1) satisfies 

d _1 tr [UU] = x with d" 1 |tr[/| equal to the right hand side of (f3"Tj) . So what re- 
mains to be shown is an upper bound on d _1 |tr U\. 

Let U £ U(d) be a unitary matrix with d _1 tr [UU~\ = x. By the Youla theorem j24j], 

given any conjugate-normal matrix A (that is, AA^ — .At A), there exists a unitary 
V such that VAV T is a block diagonal matrix with diagonal blocks of order lxl 
and 2x2, the lxl blocks corresponding to the real nonnegative eigenvalues of AA 
and the 2x2 blocks corresponding either to pairs of equal negative eigenvalues of AA 
or to conjugate pairs of non-real eigenvalues of AA. Applying this to [/, there is a 
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unitary V with U — VDV T , the block diagonal matrix D as described. Let § be the 
number of 2 x 2 blocks (with even r < d) and denote these blocks by Di. As U and 
V are unitary, so must be D, i.e. Di 6 J7(2) for all i. unitary guarantees |A| = 1 
for each eigenvalue A of UU, so each real nonnegative eigenvalue of UU must be 1. 
Altogether we have D = diag (D u . . . , Dg, 1, . . . , l). Set C, := \ tr [AA] G [-1, 1] 
and D 5 := | (Z3 + Z3 T ). Using Lemma [TBI and the fact that V T V is unitary and 
symmetric, 

d 

|trZ7| = |tr [W T y] | = |tr [AV^V] | < ^ ^ (L> 5 ) 

3=1 

r/2 r/2 



= ^^a 3 (A, 5 ) + (rf-r),2.^J^ + ( i-, 

i=l j=l,2 i=l 

Let c := d ■ x G [— d + 2, d]. Some elementary analysis shows that the problem 



r/2 



max 



r/2 

subject to 2 • Ci + d — r = c, 

i=i 

Ci 6 [—1, 1] for all i, 
r even, r < d 

has optimal solution r = rf — 1, Cj = ^Ej^* an d the obtained maximum is 

1/2 

d [| (1 — 2) (l — I + 3)] + 1- This upper bound on |tr{7| corresponds exactly 
to the right hand side of (|3Tj) . □ 



C. Minimizing tr[Z7 s f/ s T2 ] subject to fixed tr[f7I7] 
Motivated by Eq. ((22J in section IVAl we investigate 



did: 



-tr 



ct.ct; 



-T 2 



U EB(H®K) unitary with — — tr [f/U] = y 

d\d2 



(32) 



for Hilbert spaces TL and /C with dimensions di and d2, respectively, and provided 
y G [—1 + 2/did-z, 1], where di > 3 is odd. 

The partial transposes Tl and T2 are defined w.r.t. a fixed product basis by the 
linear extension of 

{A ® B) Tl = A T ®B and 

{A(g>Bf 2 = A(g> B T , A®B eB{n®K), 

respectively. Note that for any A and B, 



tr 



AB 



tr [A Tl B ] ] , 
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and for any real or complex A with A T = A, the partial transposes are on equal 
footing, i.e. A Tl = A T2 , so (|32[) is inherently symmetric with respect to d\ <-> d 2 . 
We identify B(7i(8/C) = C dld2Xdld2 by means of the ordered computational basis 
(|ll>J12>,.. ; |ld 2 >,...|did 2 >). 

All quantities in (|32p . especially the minimizers, will stay invariant if we send 

U -> (Wi <g> W 2 ) [/ (Wi ® iy 2 ) T 
with arbitrary unitaries Wi G B{H) and W 2 S B(/C). 



Since every unitary matrix is also conjugate-normal (that is, U If = WU), the 
Youla-theorem 9 states that there exists a unitary matrix V 6 B(TL® JC) such that 

U = VDV T 

with Z? real block-diagonal and blocks of size lxl and 2x2, the former non-negative 
and the latter of the form (% ~ a z ) with a > 0. Since D must also be unitary, this 
equals ( cos t - sin ?) for a ■& £ R, and all 1 x 1 blocks are 1. Note that 

n V sin $ cos v ) ' 

£4 = V D s V T , D s := - (D + D T ) > diagonal, 

in particular, D 5 contains the singular values of U s . Moreover, tr \ UU] = tr \DD\ 
is independent of V, so D fixes y in (|32p and we may freely vary V. Conversely, 
Takagi's theorem [26| asserts that every complex-symmetric matrix A e C™ x " can be 
decomposed into 

A = V diag (<7i , . . . , <7„ ) (33) 

with unitary V and ct^ > for all i, so identifying A = U s and diag (ci, . . . , a n ) = 
D Sl the minimization problem (I32|) can be reduced to the following problem and a 
subsequent optimization over D s : 

AeB(H®K,),A T =A,a l {A) = (j l -ii\. (34) 



{ did2* 


AA T2 







This closely resembles (|28p . and we have effectively decoupled the target function 
from the peculiar unitary constraint in (|32|) . 



Proposition 18. Every (local) minimizer A of (|34|) satisfies A A 2 Hermitian. 



Proof. Denote the derivative w.r.t. V in (|33|) by dV^; since V is unitary, 

V dV^ + dV V 1 = d (V V^) = 
so X := idVV^ must be Hermitian. Plugging 
d 



dV 



A = dV diag (a t ) V T + V diag (a t ) dV T = i (XA + AX 1 



9 Refer in particular to Thm. 4 in 12411 . The Youla-form corresponds to the Schur-form w.r.t. unitary 
congruence transformations A >—> V AV T and is a generalization of Takagi's factorization |26H. 
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into the target function (|34|) yields 
d 



dV 



:tr 



A/ 2 



= i tr 
— i tr 
= 2itr 
= 2itr 
= 2 itr 



A (1A + AX T ) 

■T 2 



■T-2 



XAA 



— 2 tr 



A" f A ~A Fl — A T-1 ~A 
X ( AA T ' 2 - h.c 



= 0. 



T 2 

As this must hold for any Hermitian X, the last equation can only be fulfilled if A A 
is Hermitian. □ 

It is instructive to rewrite the target function as follows, setting a := (<ti, . . . , cr^do)- 
Denote the columns of V by i.e. V = {v\ \ . . . |wdid 2 )i then A = ^ f Cj WiW^ and 



did: 



-tr 



Ai4 



■T 2 



(<r|G(r>, 



G = (gij) Hermitian with 



(35) 



3U 



did 2 



-tr 



V j V j ViVi 



■To 



Writing Vi =: a;^, x^. <E K the last expression becomes 

& "77 E tr [(10 M ® ^I'^ii) (j fc '> (*l ® O^tt)* 
1 2 fc.fc' U' 



T t 

Xjk X ji Xik 



— ^ <a; ifc | a;,-,) {xu \ x jk ) 



did 



did- 



k.l 



"tr [4] 



(( x ik\ x 3l))k,l=i,...,di 



V being unitary translates to trs^ = ^ fc (xik \ Xjk) = (vi \ Vj) = Sij. 

In what follows, we provide an explicit upper bound 10 of (|32[) for di = d 2 =: d and 
d = 3. Start with the Ansatz that all 2 x 2 blocks in D belong to the same phase, i.e. 



D = 



cos i? — sin 
sin d cos $ , 



1, #e [0,7r/2], so 



^2 tr = ^2 tr M = g ( 16 cos2 ^ - 7) = y, (36) 



Numeric tests strongly suggest that this is the actual minimum. Most interestingly, the acclaimed 
minimizer V does not depend on i?! 
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and set 



V = 



\f2 2V6 
i _ /~3 

V2 V 8 



V 



%/2 2\/6 



2^2 
1 

2^2 



1 



^2 


z 


1 

V2 


1 

2^2 

i 

V2 




1 

i 




1 




1 

%/2 


i 

V2 

i 

V2 



2V2 



V2 



V3 ) 



independent of 1?! Then D s — diag (<j\, . . . , erg) with o\ = ■ ■ ■ = ag = costf, erg = 1, 
and G in (l35l) becomes 
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8 


3 
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3 
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3 
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3 
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3 
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-1 









3 
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3 
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3 
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-1 
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-1 


-1 


-1 


-1 


-1 


-1 


-1 


-1 
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^9x9 



Finally evaluating the target function provides the supposed minimum 



1 



U 5 u B T2 



= {a\Ga) = 1 (^(costf+l) 2 +3 



(37) 



with 1} defined by 

Interestingly, the smallest eigenvalue — ^ of G is of algebraic multiplicity 1 with 
corresponding eigenvector a($) evaluated at i9 — n/3 and coordinates — | (1, 1). Fur- 
thermore, (|37[) is minimal w.r.t. 1} exactly for 1!) — 0, which corresponds to maximal 
V = gjtr [UU] — 1 and <j\ = ■ ■ ■ = a d 2 = 1. In this case, D is the identity and 
U = V V T complex symmetric. Comparing with appendix IVII Dl notice that we 
obtain the same minimum value — 1|. 



D. Minimizing tr [U U T2 ] 

We explore the following minimization problem posed by Eq. ()25[) in section TV Bl 

(38) 



mm 



did : 



■tr 



uu T2 



: U £ B (H ® K) unitary i 



2G 



where TL £g> K, is the tensor product of two Hilbert spaces with dimensions d\ — dim TL 
and g?2 = dim/C, respectively, di being odd. The partial transposition is introduced 
in lVII Cl Note that any transformation 



u -> (v T <g> w/") [/ ® w 2 ") 



for unitary V d B (TL) and unitary Wi, W2 € $ (/C) leaves the target function invari- 
ant. If we allowed tensor products only, i.e. U — U\ ® U2, the target function would 
collapse to ^ tr [/7iZ7i] > — 1 + j-, which is in general strictly greater than ([55]) . see 
below. It is worth mentioning that (|38[) is inherently asymmetric w.r.t. d\ <-*■ c?2) as 
opposed to the previous section IVII Cl 



r -rrT- 



Proposition 19. U U ' is Hermitian for every minimizer U of (|38|) . 

Proof. As in previous sections, we differentiate the target function with respect to U. 
As U is unitary, X := \ dU must be Hermitian, and we get 



dU 



-tr 



UU 



■To 



i tr 
i tr 



xuu 



T 2 



i tr 



X [ UU T2 - h.c. 



T-2 



u Ti (xuy 



= 0. 



This holds for any Hermitian X , so UU must be Hermitian, too. 



□ 



Disassembly and reformulation. Let X = B (/C) be the Hilbert space equipped 
with the Hilbert-Schmidt inner product and induced Frobenius norm. By partitioning 
U as U = 5Zjj = i \i) (j\ <£> Uij with Uij G X, we can now reformulate the target 
function (f3"5)) as 



tr 



UU 



T 2 



The condition U unitary translates to 

UU j = 1 X w^fc = <5j-a V j, fe = 1, . . . , di 

i 

(u.ij \ u lk x) 8 jk (1 \x) = S jk trx Vj,/c; a; G A' 

i 

and the condition in proposition 1191 to 

[/ f/ T2 Hermitian uj^ = ■ 

<=> X (ujj I x Uki) = X (%i I x Uik) V j, fe; x € Af. 



Note that these equations can be rewritten in terms of the Hilbert-Schmidt inner 
product as shown. 
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Quaternion structure. In this paragraph we assume c?2 = 2 and set d = d±. 
Numeric tests suggest that in this case, the minimum value (|38[) is 

+ (39) 

Interestingly, there emerges a substructure which is best described by quaternions. 
Recall that quaternions 

H = {x + Xi i + x 2 j + x 3 k : x , ■ ■ ■ , x 3 G M} 

are a non-abelian division ring and form a 4-dimensional normcd division algebra over 
the real numbers. We regard R and C as subalgebras of H and denote the quaternion- 
conjugate of q = xq + x\ i + X2 j + X3 k € H by q*. Furthermore, define Reg := xq 
and q := q — Re q = x\ i + x% j ' + £3 k. 

To bridge the gap between quaternions and operators on Hilbert spaces, employ 
the identification 11 






/ / rr . = | j . j <-> iijy - 

Note that in this representation 

q ^ q = ( Xo + lXl x * + ix A eC ™ (40) 
y-x 2 +ix 3 x - ixij 

the quaternion conjugate is the Hermitian conjugate of the corresponds matrix, and 
the quaternion norm is the square root of the determinant, 



q*<^tf, |M| = y/dct(q). 

Let H d be the d-dimensional "vector space" over H with multiplication from the 
right, then each linear transformation on M d can be represented by a d x d matrix 
A € M dxd . The identification (|4"0)) provides an algebra isomorphism between M dxd = 
R dxd (gi H and the complex 2d x 2c? matrices consisting of 2 x 2 blocks (|40|) : to obtain 
Ax, define u, v £ C d by x =: u — j v and set x — (ui, v\, . . . , U4, Vd) € C 2d ; then 
Ax corresponds exactly to Ax. In the following it will be clear from context which 
representation is employed. 

For all A £ M dxd , the component-wise quaternion conjugate A* and the quaternion 
conjugate-transpose are intuitively translated to 

A*^A T \ A^Al 

Consequently, we say that A is Hermitian if A* = A. 

As in call A G C, Im A > an eigenvalue of A with corresponding eigenvector 
x £ M d if Ax = xX. These are exactly the eigenvalues of A which have nonnegative 
imaginary part. Note that most of the well-known linear algebra theorems can be 
generalized straightforward to quaternions. 



We adhere here to a different convention than e.g. [27 
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Proposition 20. Let A £ M dxd be Hermitian with eigenvalues — 1 ±d, the algebraic 
multiplicity of (— 1 + d) being 2±1 cmrf o/ — (1 + d) being respectively. Suppose 
A can be chosen such that Re A — 0, then there exists a unitary U € B (Tl <S> K.) with 



2d 



tr 



uu T ' 



2 

d 2 ' 



Proof. Since Re A = 0, we have A* = —A, and consequently A T = —A. Set U = 
i(l + A), embedding U dxd into C dxd <g> C 2x2 as described above, then U will be 
Hermitian and unitary since the eigenvalues satisfy X(U) = 4 (1 + (—1 ± d)) = ±1. 
Using U T2 = \ (1 + A*) = \ (1 - A), we get 

-Si? 4 '! 1 -^-?^-^- 1 ))- 1 *?- 

The isomorphism ([40|) introduces an additional factor 2 into the trace, which cancels 
■i in the last equation. □ 

Note that the conditions on A can be rephrased as follows. A £ H d><d is Hermitian 
with Re A = such that A 2 + 2 A - (d 2 - 1)1 = 0. The requirement Re A = implies 
the respective eigenvalue multiplicities via tr A = 0. 

To ensoul proposition l201 we provide explicit examples of A meeting all requirements 
for d = 3 and d — 5, namely 



and 







r o -i 
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I) 
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-2z 


-Vl2j 


2k 


-2j\ 


2« 








-2j 


4fc 


\/l2j 








— x/122 





-2fc 


2.7 


Vl2« 





-2i 


V 2, 


-4fe 


2/ 


/ 



respectively. Note that for these quaternion models, U is Hermitian, as well as U U , 
in agreement with proposition 1191 

Higher dimensions. Table |H] contains numerical results for different values of 
d\ and di . We have simply employed U — exp [i X] with Hermitian X to represent 
unitary matrices. The local convergence error is about 10 -6 , but it is still difficult to 
find the global minimizers. Quite remarkably, it seems that the lower bound —1 can 
be obtained for d% = 5 and cfo = 4, even if we restrict to real orthogonal matrices. 



2d 



-tr 



uu T2 



VIII. APPENDIX B-A SPECIAL EXTREMAL CHANNEL 

The following algebraic values for the coefficients /zi, . . . , /i4 of X in (O are ap- 
propriate; we have obtained them basically by guessing and suppose that at least 
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di 


d 2 = 1 


2 


3 


4 


5 


1 


1 


1 


1 


1 


1 


3 


i 

3 


7 
9 


23 
27 


_ 4 
5 


37 
45 


5 


3 
5 


23 
25 


-0.92915.. 


. -1 


-0.97632 . . . 


7 


5 
7 


47 
49 









TABLE II: Numeric solutions ol (|38p ; the columns correspond to different values of d 2 . The 
case d 2 = 1 is treated analytically and is a special case of Sec. IVIIi^l Note that d 2 = 2 is in 
agreement with (|39|l . and for di — d 2 = 3 we obtain the same value as the minimum of (|37|l 
with respect to $. 



polynomial degree 3 is required. 

Hi = i Rootx [-356 + 312a; - 66x 2 + 3a; 3 ] 1/2 
6 L J 

_ 1 / 22 _ 43 ■ 2 2 / 3 / r \ _ (977 + 213iV7) 1/3 / 

"8 ^" 3 (977 + 213 ? V7) 1/3 " + ^ ^ I 

= 0.21821, 
ju 2 = Rooti [-1 + 432.x 2 + 2592a; 3 ] 



1 l + 1 A, , n . /^\V3 

18 18(l + 3iA/7) ] 
= -0.14937, 



|p»-^( 1 - < ^)( 1 + 3< ^ 



1 

/i 4 = Root 2 [l - 6a; + 18x 3 ] 



M3 = g, 



^l + iV3) (-3 + zV7) 1/3 l-iV3 



6'2 2 / 3 3 (2 (-3 + i V7)) 1/3 

= 0.18595. 

The eigenvalues of X as calculated by a computer algebra program are then 

= jo, 0,77,7:, 0.23604, 0.097285 

with 

a = Root! [-25957 + 163107a; - 78003a; 2 + 6561a; 3 ] 

107 104 • 2 2 / 3 (1 + »V3) 13(l-iV3) (67 + 23iV7) 1/3 



27 27 (67 + 23, V7) 1/3 27 " 22/3 

= 0.17329. 
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In particular, X is positive semidefinite with rank equal to 4. Similarly, the linear 
independence of © can be verified explicitly. 
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